Towards Efficient Deep Inference for Mobile Applications
نویسنده
چکیده
Mobile applications are beneting signicantly from the advancement in deep learning, e.g. providing new features. Given a trained deep learning model, applications usually need to perform a series of matrix operations based on the input data, in order to infer possible output values. Because of model computation complexity and increased model sizes, those trained models are usually hosted in the cloud. When mobile apps need to utilize those models, they will have to send input data over the network. While cloud-based deep learning can provide reasonable response time for mobile apps, it also restricts the use case scenarios, e.g. mobile apps need to have access to network. With mobile specic deep learning optimizations, it is now possible to employ device-based inference. However, because mobile hardware, e.g. GPU and memory size, can be very dierent and limited when compared to desktop counterpart, it is important to understand the feasibility of this new device-based deep learning inference architecture. In this paper, we empirically evaluate the inference eciency of three Convolutional Neural Networks using a benchmark Android application we developed. Based on our application-driven analysis, we have identied several performance bolenecks for mobile applications powered by on-device deep learning inference. ACM Reference format: Tian Guo. 2017. Towards Ecient Deep Inference for Mobile Applications. In Proceedings of arxiv, 2017, July 2017 (arxiv’17), 7 pages. DOI: 10.1145/nnnnnnn.nnnnnnn
منابع مشابه
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
Deep neural networks are among the most influential architectures of deep learning algorithms, being deployed in many mobile intelligent applications. End-side services, such as intelligent personal assistants (IPAs), autonomous cars, and smart home services often employ either simple local models or complex remote models on the cloud. Mobile-only and cloud-only computations are currently the s...
متن کاملImproving Mobile Grid Performance Using Fuzzy Job Replica Count Determiner
Grid computing is a term referring to the combination of computer resources from multiple administrative domains to reach a common computational platform. Mobile Computing is a Generic word that introduces using of movable, handheld devices with wireless communication, for processing data. Mobile Computing focused on providing access to data, information, services and communications anywhere an...
متن کاملImproving Mobile Grid Performance Using Fuzzy Job Replica Count Determiner
Grid computing is a term referring to the combination of computer resources from multiple administrative domains to reach a common computational platform. Mobile Computing is a Generic word that introduces using of movable, handheld devices with wireless communication, for processing data. Mobile Computing focused on providing access to data, information, services and communications anywhere an...
متن کاملStressedNets: Efficient Feature Representations via Stress-induced Evolutionary Synthesis of Deep Neural Networks
The computational complexity of leveraging deep neural networks for extracting deep feature representations is a significant barrier to its widespread adoption, particularly for use in embedded devices. One particularly promising strategy to addressing the complexity issue is the notion of evolutionary synthesis of deep neural networks, which was demonstrated to successfully produce highly effi...
متن کاملUsing Mobile Phone Applications in Teaching and Learning Process
This quantitative, qualitative study investigates the usage of mobile phone applications in teaching and learning processes. The study aims to identify the benefits, difficulties, and resolutions of using mobile phone applications. The study was conducted in the English Department at Hebron University at the second semester of the academic years 2015/2016. The study focuses on the Business Engl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1707.04610 شماره
صفحات -
تاریخ انتشار 2017